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DETAILED ACTION 

1. Claims 1, 2, 8, 9, 14, 16, 20, 21 and 26-35 are currently pending. 

Continued Examination Under 37 CFR 1.114 

2. A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1 .1 14. Applicant's submission filed on 18 
February 2009 has been entered. 

Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

4. This application currently names joint inventors. In considering patentability of 
the claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of 
the various claims was commonly owned at the time any inventions covered therein 
were made absent any evidence to the contrary. Applicant is advised of the obligation 
under 37 CFR 1 .56 to point out the inventor and invention dates of each claim that was 
not commonly owned at the time a later invention was made in order for the examiner to 
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consider the applicability of 35 U.S.C. 103(c) and potential 35 U.S.C. 102(e), (f) or (g) 
prior art under 35 U.S.C. 103(a). 

5. Claims 1, 2, 8, 9, 14, 16, 20 and 21 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over US Patent No. 6,584,464 to Warthen (hereafter Warthen) 
in view of US Patent No 6,493,721 to Getchius et al (hereafter Getchius) in view of 
US PGPub 2004/0199498 to Kapur et al (hereafter Kapur) in view of US PGPub 
2004/0260677 to Malpani et al (hereafter Malpani). 

Referring to claim 1, Warthen discloses a method of compressing a log of 
natural language data (see column 4, lines 65-66), the log having a plurality of natural 
language help query strings [questions] (see column 4, lines 38-41) relative to a help 
function of a computer system, each help query string including at least two words [i.e.. 
Where can I find information on the sport bicycling?] (see column 4, lines 32-36). 
However, Warthen fails to explicitly disclose the further limitations of the actual steps 
taken to compress the log. Getchius discloses optimizing search queries (see abstract), 
including the further limitation of applying a subsumption operation to each string, 
wherein the subsumption operation identifies a single word difference between the 
query string and another query string (see column 24, lines 7-22; and column 41, line 58 
- column 42, line 15). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to utilize the concept of subsumption as disclosed by Getchius as the type of 
compression performed by Warthen. One would have been motivated to do so in order 
to increase the accuracy of determining if two strings are duplicates. 
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The combination of Warthen and Getchius (hereafter Warthen/Getchius) fails 
to explicitly disclose the further limitations of removing one of the strings and training a 
statistical process. Kapur discloses receiving various query log files from various 
sources and then processing the logs (see [0035], lines 1-8), including the further 
limitations of: applying a compression operation [canonicalized] to each string (see 
[0036], lines 3-5); and removing one of the two query strings from the log to form a 
compressed log [multiple occurrences of the same query are included as a single query] 
(see [0035], lines 10-13 and Fig 5, step 510). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to use the query processing engine disclosed by Kapur/Getchius to compress 
the log of questions disclosed by Warthen. One would have been motivated to do so in 
order to produce a set of questions which improve the process of determining the 
context of a user query and then associating the most useful result with the query 
(Warthen: see column 1 , lines 43-51) in order to produce a set of questions which 
improve the process of determining the context of a user query and then associating the 
most useful result with the query. 

The combination of Warthen/Getchius and Kapur (hereafter 
Warthen/Getchius/Kapur) fails to explicitly disclose the further limitation of training a 
statistical process with the log. Malpani discloses search category classification (see 
abstract), including the further limitation of training a statistical process [classification 
component 120 which implements a statistical model] with the log [training data source] 
(see [0030] and [0031]). 
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It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to utilize the compressed log of Warthen/Getchius/Kapur to train a 
statistical process as disclosed by Malpani since the classification component of 
Malpani can be trained from one or more of a number of sources (Malpani: see [0021]). 
One would have been motivated to do so in order to increase search providers in 
understanding user intent and providing optimal search results and content to querying 
users (Kapur: see [0005] and [0006]). 

Referring to claim 2, the combination of Warthen/Getchius/Kapur and Malpani 
(hereafter Warthen/Getchius/Kapur/Malpani) discloses the method of claim 1, wherein 
the log is a log of user-initiated inputs [users' questions] to a help interface [client 
interface] (Warthen: see column 3, lines 59-67). 

Referring to claim 8, Warthen/Kapur/Malpani/Getchius discloses the method of 
claim 1, wherein subsumption includes applying an impossibility condition to selectively 
compute edit distance [edit distance] (Getchius: see column 41, line 58 - column 42, 
line 15 and Kapur: see [0048], lines 13-31). 

Referring to claim 9, Warthen/Getchius/Kapur/Malpani discloses the method of 
claim 1, and further comprising: 

applying a second subsumption operation [tokenized] to each help query sthng 
(Kapur: see [0036], lines 23-28); 

determining if any two strings match each other after the second subsumption 
operation [convergence] (Kapur: see [0038], lines 1-2); and 



Application/Control Number: 10/796,644 Page 6 

Art Unit: 2167 

removing one of tlie two lielp query strings from tlie log (Kapur: see [0038], lines 

5-6). 

Referring to claim 14, Warthen discloses a system for compressing a query log 
having a plurality of linguistic help query strings [questions] (see column 4, lines 38-41 
and 65-66), each string having a plurality of words [i.e.. Where can I find information on 
the sport bicycling?] (see column 4, lines 32-36). 

However, Warthen fails to explicitly disclose the further limitations of the actual 
steps taken to compress the log. Getchius discloses optimizing search queries (see 
abstract), Including the further limitation of applying a subsumption operation to each 
string, wherein the subsumption operation identifies a single word difference between 
the query string and another query string (see column 24, lines 7-22; and column 41 , 
line 58 - column 42, line 15). 

It would have been obvious to one of ordinary skill In the art at the time of the 
Invention to utilize the concept of subsumption as disclosed by Getchius as the type of 
compression performed by Warthen. One would have been motivated to do so in order 
to increase the accuracy of determining if two strings are duplicates. 

However, Warthen/Getchlus falls to explicitly disclose the further limitations of the 
actual steps taken to compress the log. Kapur discloses receiving various query log 
files from various sources and then processing the logs (see [0035], lines 1-8), Including 
the further limitations of: 

an input for receiving a raw query log of natural language strings relative to a 
help function of a computer (see [0035], lines 3-8); 
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memory [memory or database file 310] for storing tlie raw query log (see [0035], 
lines 19-31); 

a processor [query processing engine 300] (see Fig 3) for applying at least one 
compression operation [canon icalized] (see [0036], lines 3-5), and for removing one of 
the strings [consolidate] (see Fig 5, step 510 and [0035], lines 10-13). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to use the query processing engine disclosed by Kapur to compress the log of 
questions disclosed by Warthen/Getchius. One would have been motivated to do so in 
order to produce a set of questions which improve the process of determining the 
context of a user query and then associating the most useful result with the query 
(Warthen: see column 1, lines 43-51). 

Warthen/Getchius/Kapur fails to explicitly disclose the further limitation of 
wherein the processor is configured to utilize the query log to train a statistical process. 
Malpani discloses search category classification (see abstract), including the further 
limitation of wherein the processor is configured to utilize the query log [training data 
source] to train a statistical process [classification component 120 which implements a 
statistical model] (see [0030] and [0031]). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to utilize the query log of Warthen/Kapur to train a statistical 
process as disclosed by Malpani since the classification component of Malpani can be 
trained from one or more of a number of sources (Malpani: see [0021]). One would 
have been motivated to do so in order to increase search providers in understanding 
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user intent and providing optimal search results and content to querying users (Kapur: 
see [0005] and [0006]). 

Referring to claim 16, Warthen/Kapur/Malpani discloses the system of claim 15, 
wherein each help-related query is relative to a computer system [corporate network 
answering employee questions] (Warthen: see column 3, lines 59-67). 

Referring to claim 20, Warthen/Getchius/Kapur/Malpani discloses the system of 
claim 19, wherein subsumption includes applying an impossibility condition to 
selectively compute edit distance [edit distance] (Getchius: see column 41 , line 58 - 
column 42, line 15 and Kapur: see [0048], lines 13-31). 

Referring to claim 21, Warthen/Getchius/Kapur/Malpani discloses the system of 
claim 14, and further comprising: 

applying a second subsumption operation [tokenized] to each string (Kapur: see 
[0036], lines 23-28); 

determining if any two strings match each other after the second subsumption 
operation [convergence] (Kapur: see [0038], lines 1-2); and 

removing one of the two matching strings from the log (see [0038], lines 5-6). 



6. Claims 26-35 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over US Patent No. 6,584,464 to Warthen (hereafter Warthen) in view of US Patent 
No 6,493,721 to Getchius et al in view of US PGPub 2004/0199498 to Kapur et al in 
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view of US PGPub 2004/0260677 to Malpani et al as applied to claims 1 and 14 
above respectively, and further in view of US PGPub 20040059736 to Willse et al 
(hereafter Willse). 

Referring to claims 26 and 31, Warthen/Getchius/Kapur/Malpani fails to 
explicitly disclose the further limitation wherein the method further comprises discarding 
the additional word and collapsing the pair of help query strings if the additional word 
does not significantly change the meaning. Willse discloses subsumption, including the 
further limitation wherein the method further comprises discarding the additional word 
and collapsing the pair of help query strings if the additional word does not significantly 
change the meaning (see [0071]). 

It would have been obvious to one of ordinary skill in the art to apply the 
subsumption operations of Willse to those of Warthen/Getchius/Kapur/Malpani. One 
would have been motivated to do so in order to decrease the amount of memory 
required to store the log through compression of the log. 

Referring to claims 27 and 32, the combination of 
Warthen/Getchius/Kapur/Malpani and Willse discloses the method of claim 26, wherein 
the subsumption operation includes a statistical operation relative to the additional word 
(Willse: see [0111]). 

Referring to claims 28 and 33, Warthen/Getchius/Kapur/Malpani fails to 
explicitly disclose the further limitation wherein the subsumption operation is absolute 
between an N word help query string and an (N-1) word help query string. Willse 
discloses subsumption, including the further limitation wherein the subsumption 
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operation is absolute between an N word help query string and an (N-1) word help 
query string (see [0071] and [0073]). 

It would have been obvious to one of ordinary skill in the art to apply the 
subsumption operations of Willse to those of Warthen/Getchius/Kapur/Malpani. One 
would have been motivated to do so in order to decrease the amount of memory 
required to store the log through compression of the log. 

Referring to claims 29 and 34, Warthen/Getchius/Kapur/Malpani fails to 
explicitly disclose the further limitation wherein the subsumption operation is guided by 
vocabulary features. Willse discloses subsumption, including the further limitation 
wherein the subsumption operation is guided by vocabulary features [vocabulary 
taxonomy] (see [0067]). 

It would have been obvious to one of ordinary skill in the art to apply the 
subsumption operations of Willse to those of Warthen/Getchius/Kapur/Malpani. One 
would have been motivated to do so in order to decrease the amount of memory 
required to store the log through compression of the log. 

Referring to claims 30 and 35, Warthen/Getchius/Kapur/Malpani fails to 
explicitly disclose the further limitation wherein subsumption is blocked if the additional 
word is in a control vocabulary. Willse discloses subsumption, including the further 
limitation wherein subsumption is blocked if the additional word is in a control 
vocabulary (see [0067]). 

It would have been obvious to one of ordinary skill in the art to apply the 
subsumption operations of Willse to those of Warthen/Getchius/Kapur/Malpani. One 
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would have been motivated to do so in order to decrease the amount of memory 
required to store the log through compression of the log. 



Response to Arguments 

7. Applicant's arguments filed on page 7 of the Remarks in regards to Getchius 
have been fully considered but they are not persuasive. The examiner respectfully 
disagrees that Getchius fails to teach the subsumption in the same manner as what is 
claimed by the limitation. Getchius deals with subsets and supersets. Also, Getchius 
discusses edit distance. The edit distance is the difference between two strings. 
Therefore the edit distance takes care of there being a difference between a single word 
in the two queries. 
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Contact Information 

Any inquiry concerning tliis communication or earlier communications from tine 
examiner should be directed to Kimberly Lovel whose telephone number is (571) 272- 
2750. The examiner can normally be reached on 8:00 - 4:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, John Cottingham can be reached on (571) 272-7079. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/John R. Cottingham/ /Kimberly Lovel/ 

Supervisory Patent Examiner, Art Unit 21 67 Examiner 

Art Unit 2167 

10 May 2009 
/kml/ 
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